Appendix A Additional figures to supplement the methods and results 
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Fig Al. The observed log-log species-area relationships (circles) across the 16 sites relative to 
the recursive (red line) and non-recursive (blue line) formulations of METE and the simulated 
semi-recursive METE (black line) when the SAD is inferred from So and No- The 95% 
confidence interval for the semi-recursive prediction is displayed as a grey envelope. A semi- 
recursive prediction for the Cross Timbers community is not available due to this site's low 
richness. 
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Fig A2. The observed arithmetic species-area relationships (circles) across the 16 sites relative to 
the recursive (red line) and non-recursive (blue line) formulations of METE and the simulated 
semi-recursive METE (black line) when the SAD is inferred from So and No. The 95% 
confidence interval for the semi-recursive prediction is displayed as a grey envelope. A semi- 
recursive prediction for the Cross Timbers community is not available due to this site's low 
richness. 
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F/g A3. This diagram is an extension of Fig. 1 in the main text and its purpose is to further 
illustrate the "user rules" of how a landscape can be bisected to different degrees (columns, i- 
values) and how samples are compared for different separation orders (rows^'-values). In this 
specific example, three bisections are used to generate a spatially explicit distribution of 10 
individuals. The red arrows illustrate which cells would be compared for a given separation 
order, j. Every possible separation order is illustrated for each level of bisection. 
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Fig A4. The log-log transformed observed and predicted distance decay relationships for each 
site at a single spatial grain (identical to Fig. 2 in the main text but with log-log axes). 
Community similarity represents the average of the abundance-based S0rensen index for each 
spatial lag. The spatial grain displayed was taken at either 8 or 9 bisections of the total area 
depending on whether the total extent was a square or a rectangle respectively. Geographic 
distance was calculated as the average physical distance between the samples compared at given 
separation order (see Methods and Fig. A3 for additional information). 
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Fig A5. The arithmetic DDR based upon medians (rather than averages) of the abundance-based 
S0rensen index. The observed (black line with dots) and predicted distance decay relationships 
(METE: dark grey lines, solid for the observed SAD, dashed for the METE SAD; random 
placement: light grey line) for each site at a single spatial grain. The spatial grain displayed was 
taken at either 8 or 9 bisections of the total area depending on whether the total extent of a site 
was a square or a rectangle respectively. Geographic distance was calculated as the average 
physical distance between the samples compared at given separation order (see Methods and Fig. 
A3 for additional information). 
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Fig A6. Kernel density lines for the coefficient of determination, R 2 , for the fitted power (black 
line) and exponential models (grey line) for the predicted METE distance decay relationship 
(DDR, left panel) and for the empirical DDRs (right panel). Both were best fit by a power model 
of DDR. The i? 2 -values were computed using weighted least- squares regression where the 
weights were assigned based on the number of comparisons at a given spatial lag. 
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Fig A7. The DDR residuals (i.e., observed - expected) for the empirical datasets relative to the 
simulated iterative METE (a) and RPM (b) across all spatial lags and grains. A lowess smoother 
was applied to the residuals for each site. The datasets are color coded by their habitat type. The 
dashed line indicates residuals of zero. 
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Fig A8. The arithmetic one-to-one plots based on medians (rather than averages) of the predicted 
and observed abundance-based S0rensen similarity values for the three models across all 
distances and spatial grains. The solid line is the one-to-one line. The grey points represent 
values from spatial grains in which the average individual density was low (i.e., less than 10 
individuals) and thus both the observed and predicted similarities must be close to zero simply 
because of a sampling effect. 
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Fig A9. Kernel density plots of the estimated intercept and slope of the power law weighted 
regression models. The black line illustrates the results for METE with the fixed SAD and the 
grey line illustrates the observed empirical results. The METE predicted curves had lower 
intercepts and slopes compared to the empirically observed patterns. 



Appendix B 

Analytical derivation of community similarity predicted by the semi-recursive formulation 
of METE and comparison with simulated results 

The MaxEnt solution to II has a simple analytical form for a special case in which the 
area A is half the total area Ao (Harte 2011, Eq. 7.51): 

n(„|A„ o) = _L ( B1) 
2 l + n o 

Equation Bl demonstrates that METE implicitly assumes there is an equal likelihood that any 
number of individuals between 0 and no will be placed in one of two halves of Ao or in other 
words that all spatial arrangements of no individuals between the two cells are equally likely. 
This prediction is identical to the assumption of the HEAP model (Harte et al, 2005; Harte, 
2007). When recursively applying Eq. Bl to successive bisections of Ao (Ao/2, Ao/4, Ao/2') 
the METE prediction for the n distribution is given by HEAP (Harte, 201 1): 

U(n\A i ,n 0 ) = f j U( ^ Ai -f 0 \ i = l,2,3,... (B2) 

q=n (0 + 1) 

where A; is defined as Ao/2 ! . Additionally, Harte (2007) derived the equations needed to compute 
the presence-absence based S0rensen similarity index for HEAP using only the abundance of 
each species at the anchor scale (Ao). Specifically, Harte (2007) derived the species-level 
commonality function, %(no\Ai, Dj, Ao), which is the probability that a species with no individuals 
is present in two A; cells separated by the j'th bisection but not the j - 1st bisection. 

X(n 0 1 A ; ,D.) = (n 0 + 1)" 1 £ X (n I 4_ p £,-_,), i = 1, 2, 3, j =1, 2, 3,... i (B3) 

n=2 

and when 7=1, 



x(n 0 1 a, A) = K +ir 1 Z[i-n(oi [i-n(oi A-x^o -«)] (B4) 

Under the METE/HEAP assumption that each species is independently spatially distributed, the 
community-level commonality function (i.e., the presence-absence S0rensen index) is 
approximated by: 

s I s 

x( A ,Dj ) - E xK* i A , ^ ) / E i - n(0 1 a , «<u ) (B5) 

where no,k represents the abundance of the kth species at Aq. Eq. B5 is the ratio of the expected 
number of species in common between two samples and the expected sample richness. It is not 
an exact formulation for the S0rensen index because it treats the numerator and denominator as 
two independent quantities (Plotkin & Muller-Landau, 2002). These results are based on using 
observed species abundances, so to derive a full METE prediction we re-express Eq. B5 in terms 
of the METE-SAD, O(nol So, No) at the anchor scale Ao yielding: 

XCA^.^Zfc^A'^ (B6) 

n=\ I n=\ 

To connect METE's spatially explicit predictions with data, distance is treated as a discrete 
variable that is defined by the geometry of the bisections (Fig 1, Appendix A, Fig A2). 
Specifically, Dj is the average distance between two Ai cells that are separated by the j'th but not 
the j - 1st bisection where \ <j<i (Ostling et ai, 2004; Harte, 2007). We will refer to Eqs. B5 
and B6 as the semi-recursive versions of METE. 

In Fig Bl, we compare the analytical and simulated forms of the distance decay 
relationship (DDR) for the presence-absence S0rensen index as well its two components: the 
numerator (i.e., average number of species) and the denominator (i.e., shared number of species). 
This comparison demonstrates that these quantities are closely approximated by Eq. B5; 



however, the accuracy of the approximation will decrease as sample area approaches the total 
area and when there are few individuals on the landscape such that many of the pairwise sample 
comparisons are undefined (i.e., when the denominator of the S0rensen index is zero). 
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Fig Bl. The analytical approximation (Eq. B5, blue line) and the average across 200 simulated 
communities (red line with pink 95%confidence interval) for the semi-recursive METE species- 
area relationship (left panel), the distance decay relationship of shared species richness (middle 
panel), and the distance decay relationship of community similarity (right panel). The input 
parameters were So = 100 and No = 5000 with a uniform SAD such that each species had 50 
individuals on a square landscape with 256 cells (i.e., 8 bisections). 
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Appendix C 

Proof that the multi-cell recursive II distribution is scale consistent 

The recursive formulation of the marginal II distribution for METE is given by (Harte, 201 1): 

n(nlA,no) = Z n(g ' A - 1 f o) ; i = l,2,3,... (CI) 

q=n (9 + 1) 

Conlisk et al. (2007, Eq. 2.2) provides the multi-cell generalization of CI: 

f<NIA.VC = 2.#=05)= n y^'^ <C2) 

where, a = (l-^)/^=l such that the following simplifications can be made: F(l,n)= 1 and 
F(2, n) = 1 + n. Thus C2 simplifies to: 

P(N\A i ,n o ,c = 2,0 = O.5) = 1 (C3) 

(« L ,« R )eS(N) H L +U R +1 

In the above equations, N is the spatial configuration of cells across the landscape containing the 
no individuals, m and tir are the abundances on the left and right side a given bisection, 5(N) is 
the set of bisections leading to landscape N, c = 2 indicates that a bisection scheme is being 
used, and <f> = 0.5 indicates the aggregation parameter is such that the equal-probability rule 
predicted by METE is in effect. The key to recognizing that Eq. C3 is scale-consistent (sensu 
Haegeman & Etienne, 2010) is that it is only defined over 5(N). In other words, the sum of the 
probabilities of the configurations of no individuals at a fine-scale grid with Mi cells when 
considered on a coarse- scale grid of Mi cells is equal to the probability of the configurations 
solved for directly at Mi (in the context of bisections, Mi = IMi where l = T ,i = 1,2,3,. . .). To 
prove that this is generally true we consider no individuals distributed across a landscape that is 
partitioned into a fine grid with M\ =4 cells and a coarse grid Mi = 2 cells. Consider the 
probability that n individuals occur in only the left quadrat of the Mi cells (i.e., m = n,nR = no-n 



where n e {0, ... , no}). There are n + 1 configurations (i.e., unique landscapes N) of the 
unlabeled individuals in the M\ cells that are consistent with n individuals occurring in only the 
left side of the Mi grid - the probability of each of these configurations is given by: 

P(N) = P(n L = n,n R =n Q -n)- P(n L = j,n R =n- j) (C4) 

Where the first term on the RHS is the probability of observing n individuals on the left hand 
side of Mi and the second term on the RHS is the probability for a given distribution of the n 
individuals across the two Mi cells with abundance j and n -j within the left Mi cell. Using Eq. 
C3 we can express C4 as: 

P(N) = 1 1 = 1 (C5) 

n + (n 0 -n) + l j + (n-j) + l (n 0 +l)(n + l) 

If we scale-transform this result to compute the probability of observing n individuals only on 
the left of the Mi grid we must sum across the probabilities of the n + 1 possible configurations 
which is: 

S P W = ; ^ n - K + 1)" 1 (C6) 

(n L ,n R )*Sm K+ 1)0 + 1) 

Equation C6 is identical to the directly calculated probability of observing n individuals on the 
left half of the Mi grid which proves that the Eq. C3 is scale consistent. 
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1 Appendix D 

2 Code Supplement 

3 We have provided all of the code to replicate the analyses on figshare: 

4 http://dx.doi.org/10.6084/m9.figshare.978918 . The file mete-spatial.zip should be downloaded 

5 and decompressed in what R recognizes as the home directory (the R code will need to be 

6 modified if you decide not to use your home directory). R v 2. 12 or higher and python 2.6 are 

7 needed to run the code. The following python packages are required: matplotlib, mpmath, 

8 numpy, and scipy, and the following R packages are required: vegan, hash, RCurl and 

9 bigmemory. 

10 It is also possible to access the code via GitHub at the following addresses: 

1 1 The primary code repository is located at: 

12 http s : //github . com/weecolog y/mete- sp atial 

13 Additional scripts needed to run the core METE DDR functions are located here: 

14 http s : //github . com/weecology/METE 

15 https://github.com/weecology/macroecotools 

16 After the files are downloaded (from figshare or GitHub) and decompressed (if using the mete- 

17 spatial.zip file) navigate to the METE directory and run the following python command 

18 python setup .py install 

19 run the same command in the macroecotools directory. 

20 To download the publically available data and run the analysis navigate to the directory ~/mete- 

21 spatial and run the command: 

22 Rscript ddr_run_all.R 

23 This script will download two datasets, analyze them, and graph the results. The plots will appear 

24 in the directory -/mete- spatial/figs/ 

25 



